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DNA insert 
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3. Excise pBK-CMV phagemid 
containiag cloned DNA insert 
by co-infection with helper phage 

11- origin (D + TT) 

SV40 potyA 
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81: Reverse Prl«er T3 Primer 

WAGGAAACAGCTATfiACCTTG 3' 5' AATTAACCCrCACTAAAGGfi 3' 



1200 MET r3 propter +1 Sx I Ss^ II SjJ I ^ \ &xiH { jfttfi I | 

s* tcacacag&aaacagctatgaccttgattacgccaagctcgaaattaaccctcactaaagggaacaaaagctggagctcgcgcgcctccaggtcgacactagt&gatccaaag 

3* ACTCTGTCCTTTGTCGATACTGGAACTAATGCGGTTCGACCTTTAATTGGGAGTG^nTCCCTTGTTTTCGACCTCGAGCGCGC&CACGTCCAGCTCTGATCACCTAGGTTTCTTAA 

lias 

1 jjw i| flsploe I ascxi 

W/nl III 5cJ I XbJ I Abt I -<f» I C/d I Sto I Xpn I 

AAnCAAAAAGCTTCTCSASAGTACTrCTAGAGCGGCCGCCGGCCCATCGAnTTCCACCtGGGTGGGGTACCAGGTAAGTGTACCCAATTCGCCCTATAfiTfiWTCSTAnXCAATTCA^ 3' f^-) 

GTTTTTCGAAGAGCTCTCATGAAGATCTCGCCGGCGCCCGGGTAGCTAAAAGGTGCGCCCACCCCAlGGTCCATTCACATGGGrrAAGCGGGATATCACrCAGCATAATGrTAAGT^^ S ' ( - ) 

^ "4- +1 T7 promoter 

3' C6GSATA TC ACTOGarA;* TG r 3* TGACCGGCAGCAAAAIG 5 
T7 Primer H13-ZQ Primer 
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DNA sequence of Tm 13.17 cDNA clone 

B E 
a c 
m o 
H R 
I I 

1 AGTGGATCCAAAGAATTCGGCACGAGACTACTAAC^aAAGTTGCTCTGTTGTCTAATCT 

MKLLCC'L I S 

6 1 CCCTCATTCTGTTGGTCACAGTTCAGGCCCTGACCGAGGCACAAATTGAGZyyVCTGAAC^ 
LILLVTVQA j|^L TEAQIEKLNK 

121 AGATCAGCAAAA2yVTGTCZUyyiTGAAAGTGGA(?TGTCGCAAGAGATCATAACCAAAGCTC 
ISKKCQNESGVSQEI ITKAR 

181 GCAACGGTGACTGGGAGGACGATCCTAAACTGAAACGCCAAGTTTTTTGCGTGGCCAGGA 
NGDWEDDPKLKRQVFCVARN 

241 ACGCCGGTCTGGCCACGGAATCGGGAGAGGTGGTGGTCGACGTGTTGAGGGAGAAGGTGA 
AGLATE S GEVVVDVL RE K VR 

301 GGAAGGTCACTGACAACGACGAAGAAACTGAGAAAATCATCAATAAGTGCGCCGTCAAGA 
KVTDNDEETEKIINKCAVKR 

361 GAGATACTGTT.GAAGAGACGGTGTTCAATACTTTCAAATGTGTCATGAAAAACAAGCCAA 
DTVEETVFNTFKCVMKNK PK 

42 1 AGTTCTCACCAGTTGATTGAACCACCACGACTAGTAGATGGTTCAAATGGTGTGCTTTAC 
F S P V D * 

X 
h 
o 
I 

481 ATATAAAAA ITAA AGTGTTTCTGATGTA A AA A AAA AAAA AA A AA AA A A A A A A ACTCG 
polyadenylation signal poly (A) tail (26) 

537 AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTTCCACCC 
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A. Mature Tm 13.17 amino acid residure 

1 LTEAQIEKLN KISKKCQNES GVSQEIITKA RNGDWEDDPK LKRQVFCVAR 
51 NAGLATESGE VWDVLREKV RKVTDNDEET EKIINKCAVK RDTVEETVFN 
101 TFKCVMKNKP KFSPVD 

B. Summary of the composition analysis for the mature Tm 13.17 
sequence: 



Residue 


Number 


Mole Percent 


A = Ala 




5.172 


"D — Ac "V 


0 


0.000 


u — uy o 


4 


3 .448 


pi _ A cr^ 
U — t\iDyf 


8 


6.897 


E = Glu 


13 


11.207 


F = Phe 


4 


3 .448 


G = Gly 


4 


3 . 448 


H = His 


0 


0.000 


I = He 


6 


5.172 


K =Lys 


16 


13.793 


L = Leu 


5 


4.310 


M = Met 


1 


0.862 


N = Asn 


8 


6.897 


P = Pro 


3 


2.586 


Q = Gin 


4 


3 .448 


R = Arg 


6 


5.172 


S = Ser 


5 


4.310 


T = Thr 


8 


6.897 


V =Val 


14 


12-069 


W = Trp 


1 


0.862 


Y = Tyr 


0 


0.000 


Z = Glx 


0 


0.000 



Molecular weight = 13171.96; Residues = 1 16; Average Residue Weight = 113.55 
Charge = 1 ; Isoelectric point = 7.74. 
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2-2 



1 GGCACGAGCAAA a (aTg| a AACTCCTCTT GTGCTTTGCGTTCGCCGCC 



MKLLLCFAFAA 

47 atcgtcatcggagctcaggctctcacggacgaacagatacagaaa 

I VI G A Q A j|^L T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTGC 
RNKl SKECQQVSGVS 

137 GAAGAGACQATCGACAAAGTCCGCACAGGTGTCTTGGTCGATGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTAGTGAAAGCC 
VATEAGDTNVEVLKA 

271 AAGGTGAAGCATGTGGCGAGGGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGGCAGAGGAGAGGAAAGGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGTGTATTTACGACAGCAAACCTGATTTCTCTCCT 
DTFK CI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTTGA C AAT AAA G G T 
ID* 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



FIG 3.0 



2-3 



1 ggcacgagcaaaa IatgI aaactcctctt gtgctttgctttcgccgcc 

mklllcfafaa 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A jj^L T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RNKI SKECQQVSGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKCi YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTT T G A C A A T A A A G G T 

I D 

polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 



poly (A) tail 
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2-3 

2-2 
2-3 

2-2 
2-3 



G G C A 
G G C A 



C G A G C 
C G A G C 



start 
I 

AAAATGAAACTCCTCTTGTGCTTTGC 
AAAATGAAACTCCTCTTGTGCTTTGC 



TTCGCCGCCATCGTCATCGGAGCTCAGGCTCTCACCG 
TTCGCCGCCATCGTCATCGGAGCTCAGGCTCTCACCG 

ACGAACAGATACAGAAAAGGAACAAGATCAGCAAAGA 
AGGAACAGATACAGAAAAGGAACAAGATCAGCAAAGA 

A T GGCA GCA GGTGTCCGGAGTGTCCCAAGAGACGATC 
AT GGCA GCA GGTGTCCGGAGTGTCCCAAGAGACGATC 



GACAAA GT CCGCACAGGTGTCTTGGTCGA 
GA CAAA GT CCGCACAGGTGTCTTGGTCGA 



G A T C C C A 
G A T C C C A 



AAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAAC 
AAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAAC 

T GGAGT GGCAACCGAAGCCGGAGACACCAATGTGGAG 
T GGAGT GGCAACCGAAGCCGGAGACACCAATGTGGAG 



2-2 

2-3 

2-2 
2-3 

2-2 
2-3 



GT A CT CAAA GCCAAGCTGAAGCATGTGGCCAGCGACG 
GT A CT CAAA GCCAAGCTGAAGCATGTGGCCAGCGACG 



A A G A 
A A G A 



GT GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 
GT GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 



GA A GGC CA C A CCAGAGGAAACGGCTTATGACACCTTC 
GA A GGC CA CA CCAGAGGAAACGGCTTATGACACCTTC 



2-2 AAGTGTATTTACGACAG 
2-3 AAGTGTATTTACGACAG 



AAACCTGATTTCTCTCCTA 
AAACCTGATTTCTCTCCTA 
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predicted Amino Acid 
Composition of 2-2 and 2-3 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.22 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


46 


47,19 


41.74 


Acidic (DE) 


20 


18.90 


17.39 


Basic (KR) 


20 


20.40 


17.39 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILPWV) 


34 


27.26 


29.57 


A Ala 


6 


3.32 


5,22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.86 


9.57 


EGlu 
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9.05 


7.00 


FPhe 
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3.44 


2.61 


GGly 
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1.78 


3.48 


H His 
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[ lie 
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5.29 
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KLys 


18 


17.97 
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5 
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M Met 
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1.02 


0.87 


N Asn 
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1.78 


1.74 


P Pro 
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3.02 


3.48 


QGIn 


6 


5-98 


5.22 


R Arg 


2 


2.43 


1.74 


S Ser 


7 


4.75 


6.09 


TThr 


9 


7.08 


7.83 


VVal 


14 


10.80 


12,17 


WTrp 
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0.00 


0,00 


YTvr 
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2.54 


1.74 


B Asx 
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0.00 


0.00 


ZGIx 
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0.00 


0.00 


X Xxx 
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0.00 


.Ter 
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FIG 4.1 




FIG 4.2 
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Tm 13.17 cDNA 



1 AGTGGATCCKPiAGIiATTCGGCACGAGACTACTKACi^^ 

MKLLCC'LIS 

6 1 CCCTCATTCTGTTGGTCACAGTTCAGGCCCTGACCGAGGCACAAATTGAGi^ 

LILLVTVQA AL T E A Q I E K L N K 

T Forward Primer 



121 agatgagcaaaaaatgtcaaaatgmagt)::^^ 



iskkcqnesgvsqeiitkar 

181 gcaacggtgactgggaggacgatcctaaactgaaacgccaagttttttgcgtggcc^^ 

ngdweddpklkrqvfcva r u 

241 acgccggtctggccacggaatcgggagaggtggtggtcgacgtgttgagggagaaggtga 
aglatesgevvvdvlre k vr 

301 ggaaggtcactgacaacgacaaagaaactgagaaaatcatcaata^^ 

kvtdndeetekiinkcavkr 
Reverse Primer 



361 GA jGATACTGTTGAAGAGAj CGGTGTTCAATACTTTa^jyiTGTGTCATGAA^ 



DTVEETVFNTFKCVMKNK P 



421 AGTTCTCACCAGTTGATTGAACGACCACGACTAGTAGATGGTTCAAATC'GTGTGCTTTAC 
F S P V D ★ 



481 ATATAAAJLATAAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAACTC 

FIG. 4.6 a 
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percent % composition 
Primer A C G T MeltingTemperature (°C ) 



Forward 28.6 14.3 42.9 14,3 44.0 
Reverse 25.0 31.3 .6.3 37.5 44.0 



FIG 4.6 




FIG 4.7 




FIG 4.8 




FIG 4.9 



3-4 



1 GGCACGAGCAAAA lATGl A AACTCCTCTTGTGCTTTGCTTTCGCCQCG 

MKLLLGFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTGACGGACGAACAGATACAGAAA 
1 VI GAGA jLL T D E Q I Q K 

92 AGGAAGAAGATCAGGAAAGAATGCGAGGAGGTGTGCGGAGTGTGC 
RNKI SKEGQQVSGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

182 CGCAAAATGAAGAAGCACGTCGTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

226 GTGGGAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKi V 

316 CAGAAGTGGGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKGVVKKATPEETAY 

361 GACACCTTCAAGGTTATTTACGACAGTAAACCTGATTTCTCTCCT 
D TFKVi YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTT GAC AATAAA GGT 
I D * 

polyadenylation signal 

451 ACTATGGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



FIG. 4.10 a 



AnalvfitS 

/Al Id* J 


Whole Protein 




12839.70 m.w. 


Length 


115 


1 microgram = 


77.883 pMoies 


Molar Extinction coefficient 


2920±5% 


1 A(280) = 


4.40 mg/ml 


Isoelectric Point 


7.14 


Charge at pH 7 


0.16 



Predicted Amino Acid 



Composition of 3-4 





Number 


%bv 


%by 


Ammo ACiQis) 




weight 


frequency 


Cnargeo (nrvnYouc:; 


47 


46.41 


40.87 


Acidic (Ut) 


20 


18,91 


17.39 


oaSIC vIaH; 


20 


20.41 


17.39 


Polar (NLrUo 1 T j 


29 


24.55 


25,22 


nyuropnouic ^/Mi_rvvv^ 


35 


28.04 


30.43 


A Ala 


u 


3.32 


5.22 


C Cys 




2.41 


2.61 




11 


9.86 


9,57 


EGiu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


1 lie 


6 


5.29 


5.22 


K Lys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


Met 


1 


1.02 


0.87 


N Asn 
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1.78 


1.74 


P Pro 


4 


3.02 


3.48 


QGIn 
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5.99 


5.22 


R Arg 


2 


2.43 


1.74 


S Ser 
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4,75 


6.09 


TThr 
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7.09 


7.83 


VVal 


15 


11.58 


13.04 


WTrp 


0 


0.00 


0,00 


YTyr 
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2.54 


1.74 


B Asx 
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0,00 


0.00 


ZGix 


0 


0.00 


0-00 


X Xxx 


0 


0.00 


0.00 


.Ter 
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0.00 


0.00 



FIG. 4.10 b 



3-9 



1 GGCACGAGCAAAA[ATiA AACTCCTCTTGT GCTTTGCTTTCGCCGCC 



M LLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGATGAACAGATACAGAAA 
I VI G A Q A i|^L T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGAGTCCGGAGTGTGC 
RNKl SKECQQESGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTGGACGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCAGGTCCTCTGCTTCTCGAAGAGAACTGGA 
PKMKKHVLCFSKRTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTAGTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGGGTGGTGAAGAAGGCCACACGAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACGTTGAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATT AATT GTTTTGT ATTTGACT GAATTTT GACA.ATA_A^GGT 

' D • . . , 

polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 

poly (A) tail 



FIG. 4.11 a 



Predicted Amino Acid 



Composition of 3-9 



Analysis 


Whole Protein 


Moiecular Weight 


12871.80 m.w. 


Length 


115 


1 microgram = 


77.689 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4,23 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


% by 


% by 




count 


weight 


frequency 




48 


47.31 


41.74 




20 


18.86 


17.39 
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17.39 
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25.29 


26.09 


Hvdrnnhnhir (AW FWV^ 


34 


27.20 


29.57 


A Ala 

M /Aid 
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3.31 


5.22 


C Cys 
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3.21 


3.48 


D Asp 


11 
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9.57 


EGlu 
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7.83 


F Phe 
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2.61 


G Gly 
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H His 
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1 ile 
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K Lys 
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P Pro 


4 


3.02 


3.48 


Q Gin 
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R Arg 
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S Ser 
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T Thr 
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FIG. 4.11 b 



7-5 



1 GGCACGAGCAAA A |AT Gl A AACTCCTCTTGTGCTTTGCGTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAGTGCCAGCAGGTGTCCGGAGTGTCC 
RNKl SKECQQESGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKRTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVL KA 

271 AAGCTGAAGCATQTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGGCTGAATTTT GAC AAT A AA GGT 
1 D * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



FIG. 4.12 a 



Predicted Amino Acid 



Composition of 7-5 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 fP.w. 


Length 


1 10 




77.859 pMoies 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.22 mg/mi 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 



Amino Acicl(s) 



Number 
count 



%by 
weight 



%by 
frequency 
41.74 
17.39 
17.39 
26.09 
29.57 



Charged (RKHYCOE) 
Acidic (DE) 
Basic (KR) 
Polar (NCQSTY) 
Hydrophobic (AiLFWV) 



48 
20 
20 
30 
34 



47.19 
18.90 
20.40 
25.35 
27.26 



A Ala 
CCys 
D Asp 
EGlu 
FPhe 
GGiy 
HHis 
I Ite 
KLys 
LLeu 
M Met 
N Asn 
P Pro 
QGln 
R Arg 
S Ser 
TThr 
VVal 
WTrp 
YTyr 



6 
4 

11 
9 
3 
4 
2 
6 

18 
5 
1 
2 
4 
6 
2 
7 
9 

14 
0 
2 



3.32 
3.21 

9- 86 
9.05 
3.44 
1.78 
2.14 
5.29 

17.97 
4.41 
1.02 
1.78 
3.02 
5.98 
2.43 
4.75 
7.08 

10- 80 
0.00 
2.54 



5.22 
3.48 
9.57 
7.83 
2-61 
3.48 
1.74 
5-22 
15.65 
4.35 
0.87 
1.74 
3.48 
5.22 
1.74 
6.09 
7.83 
12.17 
0.00 
1.74 
0.00 
0.00 
0.00 
0.00 



B Asx 
ZGIx 
X Xxx 

-Ter 



0,00 
0.00 
0.00 
0.00 



FIG. 4.12 b 
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His-tagged Clone 2.2 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGO CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCG TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lvs Leu T.^u Leu Cvs Phe Ala Phe Ala Ala He Val He Gly Ala 
-15 -10 -5 

N-terminal of mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 2 76 

Gin Ala Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser 
- 15 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC C^A GAG ACG ATC GAC 3 21 

Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 3 66 

Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAG GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 
He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGAC AAT AAA GGTAATA TCGTTATGTA AAAAAAAAAA 645 

AAAAAACTCG AGCACCACCA CCACCACCAC TGAGAT 681 
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His-tagged clone 2.2 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT AGO ATG GGC AGC AGO CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 185 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 3 66 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAG GTG GAC AAG ATC 411 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 
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His-tagged clone 2.3 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGO AGO CAT CAT CAT CAT CAT CAC AGC 9 6 

Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCT TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lvs Leu Leu Leu Cvs Phe Ala Phe Ala Ala lie Val lie Glv Ala 
-15 -10 ' -5 

N- terminal of Mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 27 6 

Gin Ala Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser 

15 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr lie Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 3 66 

Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAA GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys lie Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 

Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 

lie Tyr Asp Ser Lys Pro Asp Phe Ser Pro lie Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGAC AAT AAA GGTACTA TCGTTATGAA AAAAAAAAAA 645 

AAAAAAACTC GAGCACCACC ACCACCACCA CTGAGAT 682 
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His -tagged Clone 2.3 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGO CAT CAT CAT CAT CAT CAC AGC 96 

Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 3 66 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAA GTG GAC AAG ATC 411 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 



FIG. 5.10 



His-tagged Tm 13.17 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-65 -60 -55 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-50 -45 -40 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC TGG ATC CAA AGA ATT 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Trp lie Gin Arg lie 
-35 -30 -25 

AFP Start Codon 

CGG CAC GAG ACT ACT AAG ATG AAG TTG CTC TGT TGT CTA ATC TCC 231 
Arg His Glu Thr Thr Lys Met Lvs Leu Leu Cvs Cvs Leu lie Ser 
-20 -15 -10 

N-terminal of mature AFP 
CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG ACC GAG GCA CAA ATT 27 6 

Leu lie Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin lie 
-5 15 

GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA 321 
Glu Lys Leu Asn Lys lie Ser Lys Lys Cys Gin Asn Glu Ser Gly 
10 15 20 

GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG 3 66 

Val Ser Gin Glu lie lie Thr Lys Ala Arg Asn Gly Asp Trp Glu 
25 30 35 

GAC GAT CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC 411 
Asp Asp Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn 
40 45 50 

GCC GGT CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG 456 
Ala Gly Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu 
55 60 65 

AGG GAG AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG 5 01 

Arg Glu Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu 
70 75 80 

AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG 546 
Lys lie lie Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu 
85 90 95 

ACG GTG TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG 595 
Thr Val Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys 
100 105 110 

Stop Codon 

TTC TCA CCA GTT GAT TGA ACCACCACGA CTAGTAGATG GTTCAAATGG 643 
Phe Ser Pro Val Asp * 
115 

Polyadenylation signal Poly-A tail 
TGTGCTTTAC ATATAA AAAT AAA GTGTTTC TGATGTAAAA AAAAAAAAAA 693 

AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG CCCATCGTTT 743 

TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 777 
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His-tagged Tm 13.17 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 9 6 

Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N- terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGC CTG ACC GAG GCA CAA ATT GAG AAA 186 
Gly Gin Gin Met Gly Arg Gly Leu Thr Glu Ala Gin lie Glu Lys 
-5 15 

CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA GTG TCG 231 
Leu Asn Lys lie Ser Lys Lys Cys Gin Asn Glu Ser Gly Val Ser 
10 15 20 

CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG GAC GAT 27 6 

Gin Glu lie lie Thr Lys Ala Arg Asn Gly Asp Trp Glu Asp Asp 
25 30 35 

CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC GCC GGT 321 
Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn Ala Gly 
40 45 50 

CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG AGG GAG 3 65 

Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu Arg Glu 
55 60 65 

AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG AAA ATC 411 
Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu Lys lie 
70 75 80 

ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG ACG GTG 456 
lie Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu Thr Val 
85 90 95 

TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG TTC TCA 501 
Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys Phe Ser 
100 105 110 

Stop Codon 

CCA GTT GAT TGA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro Val Asp * 
115 
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DNA sequence of Tm 13-17 cDNA clone 



B E 
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m o 
H R 
I I 

1 agtggatccaaagaattcggcacgagactactaacsss^g 

mkllcc'l i-s 

6 1 ccctcattctgttggtcacagttcaggccg^c^cg<sas<k^ 

lillvtvqa ^l teaqiekl nk 

121 agatcagcaaaaaatgtcaaaatgaaagtggac5tgtcgca?^gagatcataaccaaagct 
iskkcqnesgvsqei itkar 

181 gcaacggtgactgggaggacgatcctaaactgaaacggcaagttttttgcgtggccagga 
ngdweddpklkrqvfcvarn 

241 acggcggtctggccacggaatcgggagaggtggtggtcgacgtgttgagggagaaggtga 

AGLATE S GEVVVDVL'RE k vr 

301 GGAAGGTCACTGACAACGACGA?^GAAACTGAGAAAATCATCAATAAGTGGGCCGTCAA 

KVTDNDEETEKIINKCAVKR 

361 GAGAT ACTGTT.GAAGAGACGGTGTTCAATACTTTCAAATGTGTCATGAAAAAGAAGCCAA 
DTVEETVFNTFKCVMKNK pk 

421 AGTTCSGACCAGTTGAgJT.(?^ 
•F S P V D * 

X 
h 
o 

I 

481 ATATAAAA^TAAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAACTC 
polyadenylation signal poly (A) tail (26) 

537 AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTTGCACCC 
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1 GGCACGAGCAAA A |A T G| A AACTCCTCTTGTGCTTTGCGTTCGCCGCC 

MKLLLCFAFAA 
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ATCGTCATCGGAGGTCAGGCTCTGACGGAGGAACAG/ATACAGAAA 
I VI GAQA&LTDEQI OK 
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AGGAACAAGATCAGGAAAGAATGGCAGCAGGTGTGCGGAGTGTGC 
RNKl SKEGQQVSGVS 



137 CAAGAGAGGATCGAGAAAGTGCGGAGAGGTGTGTTGGTGGATGAT 
QET! DKVRTGVLVDD 

182 GGGAAAATGAAGAAGCACGTGGTGTGGTTGTGGAAGAAAACTGGA 
PKMKKHVLGFSKKTG 

226 GTGGCAAGCGAAGCCGGAGACAGGAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVL KA 

271 AAGCTGAAGGATGTGGCCAGGGACGAAGAGGTGGAGAAGATCGTG 
KLKHVASDEEVDKl V 

316 GAGAAGTGGGTGGTCAAGAAGGCGACACGAGAGGAAAGGGCTTAT 
QKGVVKKATPEETAY 

361 GACACCTTCAAGTGTATTTACGACAGGAAACCTGATTTCTCrCGT) 
DTFKCI YDSKPDFSP 



406 ATTGArTAATTGTTTTGTATTTGAGTGAATTTTGAGAATAAAGGT 
1 D ' 



polyadenylation signal 



451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 
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